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Amendments to the Claims : 

The following listing of claims will replace all prior versions, and listings, of claims in 
the application: 

1 . (Currently Amended) A method for identifying user types in a collection of 
connected content portions, comprising: 

determining at least one significant user path of connected content portions, 
said connected content portions being content portions connected to or linked to other content 
portions and reachable via a threshold number of traversals from an initial content portion; 

determining a multi-modal user path user information need for each at least 
one significant user path; 

for each content portion comprising each of the at least one significant user 

path, 

determining a multi-modal content portion feature information including at 
l e ast two of a content feature information and at least one o fi nformation, connection feature 
information, inward connection feature information and outward connection feature 
information, which for a selected content portion the multi-modal connection feature 
indicates a connection that appears on the selected content portion, the multi-modal inward 
connection feature fetere indicates a connection that refers to the selected content portion in 
the collection of connected content portions, and the multi-modal outward connection feature 
indicates a connection that is referred to by the selected content portion in the collection of 
connected content portions; 

combining each multi-modal content portion feature information for the user 
path with the multi-modal user path user information need into a multi-modal user path 
information; 
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determining a similarity function and a measure of similarity for the multi- 
modal user path information; 

determining a multi-modal clustering type; 

clustering the multi-modal user path information based on the multi-modal 
clustering type, the similarity function and the measure of similarity; and 

determining user types based on the clustered multi-modal user path 

information. 

2. (Currently Amended) The method of claim 23 claim 1 , wherein the multi- 
modal user path user information need is a multi-modal user path information need vector and 
the multi-modal content portion feature information is a multi-modal content portion feature 
vector. 

3. (Original) The method of claim 2, wherein determining significant user paths 
uses the longest repeating sub-sequences. 

4. (Original) The method of claim 2, wherein determining content feature 
information is based on weighted word frequency of each content portion. 

5. (Original) The method of claim 2, wherein determining the connection feature 
information comprises breaking the connection portion into constituent words using "/" and 
"." as word boundaries. 

6. (Original) The method of claim 2 5 wherein determining the inward connection 
feature information and the outward connection feature information further comprises 
normalizing the inward connection feature information and the outward connection feature 
information. 

7. (Original) The method of claim 2, wherein the similarity functions is based on 
determining the cosine between two multi-modal vectors. 
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8. (Original) The method of claim 2, wherein the multi-modal clustering type is 
at least one of K-means clustering, wavefront clustering. 

9. (Original) The method of claim 2, wherein each content portion in the user 
path is weighted using at least one of a content portion access frequency weighting, a 
weighting of the content portion based on content portion position in the user path. 

10. (Original) The method of claim 2, wherein each multi -modal feature vector 
may be independently weighted. 

1 1 . (Currently Amended) A system for identifying user types in a collection of 
connected content portions, comprising: 

a controller circuit, a memory circuit, and an input/output circuit; 
a multi-modal clustering type determining circuit that determines a multi- 
modal clustering type; 

a content determining circuit; 

a user path determining circuit that determines at least one significant user 
path of connected content portions, said connected content portions being content portions 
connected to or linked to other content portions and reachable via a threshold number of 
traversals from an initial content portion; 

a multi-modal user path user information need determining circuit that 
determines a user information need for each significant u ser path and the user information 
need includes a value that reflects a probability that a user will browse through a content 
portion in at least one significant user path, 

multi-modal conten t, multi modal connection, multi modal inward connection 
nnd multi modal outward conn e ctio n portion feature information determining circuits that 
determine multi-modal content information and at least one o f cont e nt multi-modal 



Xerox Docket No. D/A0A28 
Application No. 09/820,988 

connection, multi-modal inward connection and multi-modal outward connection feature 
information for each content portion comprised in a user path, which for a selected content 
portion the multi-modal connection feature indicates a connection that appears on the selected 
content portion, the multi-modal inward connection feature indicates a connection that refers 
to the selected content portion in the collection of connected content portions, and the multi- 
modal outward connection feature indicates a connection that is referred to by the selected 
content portion in the collection of connected content portion; 

wherein the controller combines each content portion multi-modal content 
portion content, multi modal connection, multi modal inward connection and multi modal 
outward connection feature information for the user path with the multi-modal user path user 
information need into a multi-modal user path information; 

a similarity function determining circuit for determining similarity between 
two multi-modal information; 

a multi-modal clustering circuit that clusters the multi-modal user path 
information based on the multi-modal clustering type, the similarity function and a specified 
measure of similarity; and 

a cluster analyzing circuit that determines user types based on the clustered 

multi-modal user path information. 

12. (Currently Amended) The system of claim 2 4 claim 11 , wherein the multi- 
modal user path user information need is a multi-modal user path information need vector and 
the multi-modal content portion feature information is a multi-modal content portion feature 
vector. 

1 3 . (Original) The system of claim 1 2, wherein the user path determining circuit 
determines significant user paths using the longest repeating sub-sequences. 
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14. (Original) The system of claim 12, wherein the multi -modal content feature 
information determining circuit determines words based on weighted word frequency of each 
content portion. 

15. (Original) The system of claim 12, wherein the multi-modal connection 
feature information determining circuit determines connection features by breaking the 
connection portion or link into constituent words using 7" and "." as word boundaries. 

16. (Original) The system of claim 12, wherein the multi -modal inward 
connection feature determining circuit and the multi-modal outward connection feature 
determining circuit normalize the inward connection feature information and the outward 
connection feature information. 

17. (Original) The system of claim 12, wherein the similarity function 
determining circuit determines similarity based on the cosine between two multi-modal 
vectors. 

1 8. (Original) The system of claim 12, wherein the multi-modal clustering type is 
at least one of K-means clustering, wavefront clustering. 

1 9. (Original) The system of claim 12, wherein each content portion in the user 
path is weighted by at least one of a content portion access frequency weighting circuit that 
weights the content portion based on access frequency, a path position weighting circuit that 
determines a weighting based on the position of the content portion within the user path. 

20. (Original) The system of claim 12, further comprising a multi-modal feature 
weighting circuit that weights each multi-modal feature vector independently. 

21-22. (Canceled) 

23. (Previously Presented) A method for identifying user types in a collection of 
connected content portions, comprising: 
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determining at least one significant user path of connected content portions; 

determining a multi-modal user path user information need for each at least 
one significant user path and the user information need includes a value that reflects a 
probability that a user will browse through a content portion in at least one significant user 
path; 

the probability being estimated using a spreading activation algorithm which 

generates a document vector A using the following formulas: 

A(l) = ALPHA * Matrix S * E (1) 
A(t) - ALPHA * Matrix S * A(t-l) + E (2) 
where the formulas are applied t times, the matrix S reflects a topology matrix, vector 

E reflects the user path, and ALPHA reflects the probability a user will browse through the 

content portion; 

for each content portion comprising each of the at least one significant user 

path, 

determining a multi -modal content portion feature information including a 
content feature information, connection feature information, inward connection feature 
information and outward connection feature information; 

combining each multi-modal content portion feature information for the user 
path with the multi-modal user path user information need into multi-modal user path 
information; 

determining a similarity function and a measure of similarity for the multi- 
modal user path information; 

determining a multi-modal clustering type; and 

clustering the multi-modal user path information based on the multi-modal 
clustering type, the similarity function and the measure of similarity. 
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24. (Currently Amended) A system for identifying user types in a collection of 

connected content portions, comprising: 

a controller circuit, a memory circui t, and a input/output circuit; 

a multi-modal clustering type determining circuit; 

a content determining circuit; 

a usage determining circuit; 

a topology determining circuit; 

a user path determining circuit that determines at least one significant user 
path of connected content portions; 

a multi-modal user path user information need determining circuit that 
determines a user information need for each user path and the user information need includes 
a value that reflects a probability that a user will browse through a content portion in at least 
one significant user path; 

the probability being estimated using a spreading activation algorithm which 

generates a document vector A using the following formulas: 

A(l) = ALPHA * Matrix S * E (1) 
A(t) = ALPHA * Matrix S * A(t-l) + E (2) 
where the formulas are applied t times, the matrix S reflects a topology matrix, vector 

E reflects the user path, and ALPHA reflects the probability a user will browse through the 

content portion; 

multi-modal content, multi-modal connection, multi-modal inward connection 
and multi-modal outward connection feature information determining circuits that determine 
multi-modal content, multi-modal connection, multi-modal inward connection and multi- 
modal outward connection feature information for each content portion comprising a user 
path; 



Xerox Docket No. D/A0A28 
Application No. 09/820,988 

wherein the controller combines each content portion multi-modal content, 
multi-modal connection, multi-modal inward connection and multi-modal outward 
connection feature information for the user path with the multi-modal user path user 
information need into multi-modal user path information; 

a similarity function determining circuit for determining similarity between 
two multi-modal information; and 

a multi-modal clustering circuit that clusters the multi-modal user path 
information based on multi-modal clustering type, the similarity function and a specified 
measure of similarity. 
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